- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources6
- Resource Type
-
0004000002000000
- More
- Availability
-
60
- Author / Contributor
- Filter by Author / Creator
-
-
Bastani, Osbert (5)
-
Pu, Yewen (5)
-
Rinard, Martin (5)
-
Solar-Lezama, Armando (5)
-
Yang, Yichen (4)
-
Inala, Jeevana Priya (3)
-
Alonzo, Michael (1)
-
Baker, Matthew (1)
-
Inala, Jeevana P. (1)
-
Kumar, Vijay (1)
-
Locke, Dexter Henry (1)
-
Murphy-Dunning, Colleen (1)
-
O'Neil-Dunne, Jarlath P.M. (1)
-
Paulos, James (1)
-
Priya, Jeevana I (1)
-
Yang, Yichen D. (1)
-
Yang, Yichen David (1)
-
Ziter, Carly D. (1)
-
#Tyler Phillips, Kenneth E. (0)
-
#Willis, Ciara (0)
-
- Filter by Editor
-
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Yang, Yichen; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (, Advances in neural information processing systems)A key challenge for reinforcement learning is solving long-horizon planning problems. Recent work has leveraged programs to guide reinforcement learning in these settings. However, these approaches impose a high manual burden on the user since they must provide a guiding program for every new task. Partially observed environments further complicate the programming task because the program must implement a strategy that correctly, and ideally optimally, handles every possible configuration of the hidden regions of the environment. We propose a new approach, model predictive program synthesis (MPPS), that uses program synthesis to automatically generate the guiding programs. It trains a generative model to predict the unobserved portions of the world, and then synthesizes a program based on samples from this model in a way that is robust to its uncertainty. In our experiments, we show that our approach significantly outperforms non-program-guided approaches on a set of challenging benchmarks, including a 2D Minecraft-inspired environment where the agent must complete a complex sequence of subtasks to achieve its goal, and achieves a similar performance as using handcrafted programs to guide the agent. Our results demonstrate that our approach can obtain the benefits of program-guided reinforcement learning without requiring the user to provide a new guiding program for every new task.more » « less
-
Yang, Yichen; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (, Advances in neural information processing systems)
-
Yang, Yichen David; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (, Advances in neural information processing systems)
-
Yang, Yichen D.; Inala, Jeevana P.; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (, Advances in neural information processing systems)
-
Priya, Jeevana I; Yang, Yichen; Paulos, James; Pu, Yewen; Bastani, Osbert; Kumar, Vijay; Rinard, Martin; Solar-Lezama, Armando (, Advances in neural information processing systems)null (Ed.)
An official website of the United States government

Full Text Available